Journal of Beijing University of Posts and Telecommunications

  • EI核心期刊

JOURNAL OF BEIJING UNIVERSITY OF POSTS AND TELECOM ›› 2009, Vol. 32 ›› Issue (1): 65-68.doi: 10.13190/jbupt.200901.65.mayl

• Papers • Previous Articles     Next Articles

Network Flow Identification Based on Machine Learning

Guochu Shou Yihong Hu   

  • Received:2008-05-28 Revised:1900-01-01 Online:2009-01-28 Published:2009-01-28

Abstract:

Machine learning with C4.5 algorithm is proposed for network traffic identification. The correlation feature selection algorithm and the genetic algorithm are adopted to select the attribute feature subset. A method of combining N-fold cross-validation with testing set is suggested to assess the classification results of the current national broadband network traffic. Experiments demonstrate that network traffic can be successfully identified and analyzed, meanwhile, the port number and the application layer protocol label of network flows are not necessary to be known in advance.

Key words: machine learning, decision tree, flow identification